When and why are log-linear models self-normalizing?

نویسندگان

  • Jacob Andreas
  • Dan Klein
چکیده

Several techniques have recently been proposed for training “self-normalized” discriminative models. These attempt to find parameter settings for which unnormalized model scores approximate the true label probability. However, the theoretical properties of such techniques (and of self-normalization generally) have not been investigated. This paper examines the conditions under which we can expect self-normalization to work. We characterize a general class of distributions that admit self-normalization, and prove generalization bounds for procedures that minimize empirical normalizer variance. Motivated by these results, we describe a novel variant of an established procedure for training self-normalized models. The new procedure avoids computing normalizers for most training examples, and decreases training time by as much as factor of ten while preserving model quality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Normalized Log-Linear Interpolation of Backoff Language Models is Efficient

We prove that log-linearly interpolated backoff language models can be efficiently and exactly collapsed into a single normalized backoff model, contradicting Hsu (2007). While prior work reported that log-linear interpolation yields lower perplexity than linear interpolation, normalizing at query time was impractical. We normalize the model offline in advance, which is efficient due to a recur...

متن کامل

Monitoring Multinomial Logit Profiles via Log-Linear Models (Quality Engineering Conference Paper)

In certain statistical process control applications, quality of a process or product can be characterized by a function commonly referred to as profile. Some of the potential applications of profile monitoring are cases where quality characteristic of interest is modelled using binary,multinomial or ordinal variables. In this paper, profiles with multinomial response are studied. For this purpo...

متن کامل

On the Accuracy of Self-Normalized Log-Linear Models

Calculation of the log-normalizer is a major computational obstacle in applications of log-linear models with large output spaces. The problem of fast normalizer computation has therefore attracted significant attention in the theoretical and applied machine learning literature. In this paper, we analyze a recently proposed technique known as “self-normalization”, which introduces a regularizat...

متن کامل

Inspecting the mechanism: closed-form solutions for asset prices in real business cycle models

In this paper we derive closed-form solutions for a variety of prices for financial assets in an RBC economy. The equations are based on a log-linear solution of the RBC model and allow a clearer understanding of the determination of risk premia in models with production. We demonstrate not only why the premium of equity over the risk-free rate is small but also why the premium of equity over a...

متن کامل

Evaluation of prognostic factors affecting long and short term survival rates of Hodgkin's lymphoma patients using the cure fraction models

Background and Aim: This study aimed to analyze the factors affecting time and experience of relapse in the patients with Hodgkin's lymphoma, using cure fraction. Material and Methods: This retrospective study included all the patients diagnosed as Hodgkin's lymphoma in the Center for oncology and hematology in Shafa Hospital in Ahwaz City from 2002 to 2012. We used survival analysis and cure f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015